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Abstract: This paper considers systems subject to nonholonomic constraints 
which are not uniform on the whole configuration manifold. When the con- 
straints change, the system undergoes a transition in order to comply with the 
new imposed conditions. Building on previous work on the Hamiltonian the- 
ory of impact, we tackle the problem of mathematically describing the classes 
of transitions that can occur. We propose a comprehensive formulation of the 
Transition Principle that encompasses the various impulsive regimes of Hamil- 
tonian systems. Our formulation is based on the partial symplectic formalism, 
which provides a suitable framework for the dynamics of nonholonomic systems. 
We pay special attention to mechanical systems and illustrate the results with 
several examples. 

Keywords. Mechanical systems, variable nonholonomic constraints. Transi- 
tion Principle, impulsive motion 

1. Introduction 

In this paper we consider the problem of mathematically describing impulsive 
motions (impacts, collisions, reflection, refractions) of Hamiltonian systems sub- 
ject to nonholonomic constraints. An impulsive behavior takes place when one 
or more of the basic ingredients of the Hamiltonian dynamical picture undergoes 
a drastic change. As an example, one may consider the instant of time when the 
configuration space of the system collapses instantaneously because of an inelas- 
tic collision. Another example is given by a ray of light that splits into reflected 
and refracted rays when passing from one optical media to another, and so on. In 
situations like these, the phase trajectory of the system becomes discontinuous 
and the problem of how to describe this discontinuity arises. 

The problem of describing impulsive motion has been extensively studied in 
classical books such as Pll?7ii29ii30ii33) . In these references, the emphasis is put 
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on the analysis of mechanical systems subject to impulsive forces, and in par- 
ticular, the study of rigid body collisions by means of Newton and Poisson laws 
of impact. Impulsive nonholonomic constraints (i.e. constraints whose reaction 
force is impulsive) are also considered in [^,33 , and, from a geometric per- 
spective, in more recent works such as [1^11171120} . If impulsive constraints and 
impulsive forces are present at the same time, Newton and Poisson approaches 
have been revealed to be physically inconsistent in certain cases P^35| . This sur- 
prising consequence of the impact laws is only present when the velocity along 
the impact surface stops or reverses during collision due to the friction. En- 
ergetically consistent hypothesis for rigid body collisions with slip and friction 
are proposed in [34 , 35 . From a design point of view, the interest in systems 
subject to impulsive forces is linked to the emergence of nonsmooth and hybrid 
dynamical systems in Control Theory, i.e., systems where continuous and dis- 
crete dynamics coexist, see [7)IH|l??|l37| and references therein. Hybrid mechanical 
systems that locomote by switching between different constraint regimes and 
are subject to elastic impacts are studied in J^. Hyper- impulsive control of 
mechanical systems is analyzed in |16| . 

Here, we aim at a comprehensive analysis of the various situations which can 
occur concerning impulsive regimes of nonholonomic Hamiltonian systems. In 
particular, we focus on two different but complementary cases. The first one 
deals with a drastic change in the nonholonomic constraints imposed on the sys- 
tem. The second one concerns a drastic change of the Hamiltonian function and 
includes, in particular, collisions and impacts of nonholonomic systems. The 
proposed solution is given in terms of a generalized version of the Transition 
Principle. This principle, sketched for the first time in a series of lectures of 
the second author jSj for discontinuous Hamiltonians, was recently extended to 
other non-constrained situations in |31ll^) (see also JS] for a related discussion 
in an optimal control setting). By its very nature, the Transition Principle is a 
direct dynamic interpretation of the geometric data of the problem. This feature 
distinguishes it from other approaches. For instance, in Classical Mechanics, the 
velocity jumps caused by an impact are traditionally derived from some assump- 
tions on the nature of the impulsive forces (see, for instance, E1E2I)- However, 
these assumptions are not logical consequences of the fundamental dynamical 
principles and therefore one should really consider them as additional principles 
for impulsive problems. The distinguishing feature of the Transition Principle 
is that it gives full credit to the geometry of the nonholonomic Hamiltonian 
system. This seems reasonable for the impulsive regime, keeping in mind the 
perfect adequacy of the Hamiltonian description to the dynamical behavior of 
the system in the absence of impulsive motions. In addition, there are some 
noticeable advantages deriving from the Transition Principle. First of all, its ap- 
plication gives an exact and direct description of the post-impact state which is 
of immediate use for both theoretical and computational purposes. Secondly, it 
is still valid in some situations where standard methods can be hardly applied. 
In particular, this is the case of Hamiltonian systems describing the propaga- 
tion of singularities of solutions of partial differential equations (consider, for 
instance, the example of geometrical optics) 24J2Il.3i&i ■ Clearly, no variational 
or traditional approach can be applied to this very important class of systems. 

A second contribution of this paper concerns the formulation of the dynamics 
of nonholonomic Hamiltonian systems. We make use of the notion of partial 
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symplectic structures introduced in 6 and relate this framework with other 
modern approaches to nonholonomic systems (see p^HII MT!illiyil21ll2fill86| and 
references therein). One advantage of the partial symplectic formalism is that 
it allows us to draw clear analogies between the unconstrained and constrained 
situations. Another advantage is that the treatment of nonlinear constraints can 
be easily incorporated. 

The paper is organized as follows. Section [3 introduces some geometric pre- 
liminaries on distributions, constraint submanifolds and partial symplectic struc- 
tures. In Section |31 we show how any nonholonomic Hamiltonian system pos- 
sesses an associated partial symplectic structure, and we use this fact to in- 
trinsically formulate the dynamics. We also analyze systems with instantaneous 
nonholonomic constraints and systems exhibiting discontinuities. In Sectional 
we develop a new formulation of the Transition Principle for systems with con- 
straints. We present the novel notion of focusing points with respect to a con- 
straint submanifold and we also introduce the concept of constrained character- 
istic. Decisive points are defined for each impulsive regime resorting to in, out 
and trapping points. Section [S] presents a detailed study of the concepts intro- 
duced in the previous sections in the case of mechanical systems. We compute 
the focusing points and the characteristic curves, and present various results con- 
cerning the decisive points. We also prove an appropriate version for generalized 
constraints of the classical Carnot's theorem for systems subject to impulsive 
forces: if the constraints are linear, we show that the Transition Principle always 
implies a loss of energy. We conclude this section by showing that if the con- 
straints are integrable, then our formulation of the Transition Principle recovers 
the solution for completely inelastic colhsions [32] ■ Finally, Section presents 
various examples of the application of the above-developed theory. 

To ease the exposition, below we make use of the standard notation concerning 
differential geometry and the Hamiltonian formalism without making explicit 
reference to any work. In particular, we denote by (resp., D{M)) the C°°{M)- 
module of ith order differential forms (resp., of vector fields) on a manifold M. 
We use F*{tp) to denote the puUback with respect to a smooth map of a 
function or differential form (p. If a; is a point of M , then the subscript x refers 
to the value of the corresponding geometric object at x. For instance, stands 
for the vector field vector X G D{M) evaluated at x. The interested reader may 
consult classical books such as 1 .18.23 for further reference. We also assume 
smoothness of all the objects we are dealing with. 



2. Preliminaries 

In this paper we deal with Hamiltonian systems defined on the cotangent bundle 
T*M of an n-dimensional manifold M. In the particular case of a mechanical 
system, M and T*M are, respectively, the configuration space and the phase 
space of the system. As usual, ttm : T*M ^ M (or simply vr) stands for the 
canonical projection from T*M to M, H e C°°{T*M) for the Hamiltonian 
function and Xh £ D{M) for the corresponding Hamiltonian vector field. The 
canonical symplectic structure on T*M is denoted hy Q = Qm- In canonical 
coordinates {q°',pa), a — 1, . . . ,n oi T*M, the symplectic form reads f2 — dq^ A 
dp a- 
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We say that the Hamiltonian system (M, H) comes from a Lagrangian system 
{M,L) onTMiiH ^ {Cl)'^ {El), where El G C°°{TM) is the energy function 
corresponding to the (hyper- regular) Lagrangian function L 6 C°°(TM) and 
£l '■ TM — * T*M is the associated Legendre map. 

If X is a vector field on T*M, then the map ax : T*M TM defined by 

ax{e) - den{Xg) e T^(e)M , e T*M , 

denotes the anti-Legendre map associated with X. In standard coordinates, if 
X = A''{q,p)^+B-{q,p)^^, then ax reads ax{q'',Pa) = {q\A-{q,p)). For 
the Hamiltonian vector field X = Xh, we write aH instead of axni so that 

an ■■ {q,p) ^ i^l^'"^^ 

It is not difficult to see that if the Hamiltonian system (Af , H) comes from a 
Lagrangian system (M, L), then an = ■ 



2.1. Distributions and codistributions. Recall that a distribution (resp., codistri- 
bution) on a manifold M is a vector subbundle of TM (resp., of T*M). The 
annihilator of a distribution T> on M is the codistribution Ann(I?) defined by 

Ann(I?), ^{Oe T*M \ 0(0 = 0, £ , xeM. 

If I? is (rt — m)-dimensional, the codistribution Ann(P) is m-dimensional. The 
dual bundle T>* of T) is canonically identified with the cotangent bundle T*M 
modulo Ann{T>). We will also denote by T)^ the orthogonal complement of a 
distribution T> on T* M with respect to the symplectic form il, i.e., 

= {C e Ty{T*M) I ny{^, 77) - , V77 e , 2/ e r*M . 

A vector field X e D{M) belongs to P if e P^ for all a; G M. Vector fields 
belonging to P constitute a C°°(M)-module, denoted by Dt>{M), which is a 
submodule of D{M). In the partial symplectic formalism (see Section[^lbelow) , 
they are interpreted as "constrained" vector fields. Dually, denote by A\j{M) 
the C°° (M)-module of sections of the bundle P* and by A'!^{M) its ith exterior 
product. These are interpreted as "constrained" differential z-forms. We denote 
the natural restriction map from A'{M) to yl|,(M) by : A^M) A^,{M). 

The geometric description of nonholonomic systems in the framework of the 
partial symplectic formalism ,6, requires a slight "affine" generalization of these 
standard notions. Namely, an affine distribution on a manifold M is an affine 
subbundle A of TM. This means that the fiber of A over x S M is an affine 
subspace in T^M. Therefore, A^ can be represented in the form Aj; = v + A^ 
with V € TxM and being the vector subspace of T^M canonically associated 
with Ax. In this representation, the displacement vector v is unique modulo 
Z\°. The union (JxeMA^ constitutes a linear distribution of the tangent bundle 
TM, denoted by Z\°, canonically associated with A. It is not difficult to see 
that there always exist a vector field Y G D{M) such that Y,, is a displacement 
vector for A^. Such vector fields are called displacement vector fields of A. Ob- 
viously, displacement vector fields differ by another vector field belonging to A. 
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In coordinate terms, an (n — m)-dimensional afRnc' c-odistribution is described 
by a system of linear equations = with respect to the variables Pa, i.e., 
^i(q,p) = 'Pia{q)Pa +^io{q), i=l,...,m. 

Similarly, an ajfine codistribution on M is an afhnc subbundlc C <Z T* M 
of the cotangent bundle. As above, one has C = T + C°, where C° is the 
unique codistribution on M canonically associated with C, and Y & (M) is a 
displacement form. Point-wisely this means that Cx =Tx + C^, for all a; G M. 

2.1.1. Linear constraints. In the case of linear constraints, the analogy between 

free and constrained systems is particularly clear. In fact, it is natural to in- 
terpret an afSne distribution (resp., codistribution) on a manifold M as the 
"constrained" tangent (resp., cotangent) bundle of M. A linearly constrained 
Hamiltonian system is then a triple (M, H, C), with H e C°°{T*M) and C an 
afiine codistribution on M. Similarly, a triple (M, L,Z\), with L e C°°{TM) 
and A an afiine distribution on M , is a linearly constrained Lagrangian system. 
The anti-Legendre map allows one to pass from a constrained Hamiltonian sys- 
tem to the corresponding Lagrangian system and vice versa. More precisely, if 
(M, _ff, C) is a linearly constrained Hamiltonian system, the map an is linear 
and {M,H) comes from a Lagrangian system {M,L), then the corresponding 
linearly constrained Lagrangian system is (M, L, A), with A ~ aniC). To go in 
the opposite direction, one must use the Legendre map £l instead of Uh- 

Throughout the paper, we distinguish the class of mechanical systems subject 
to linear constraints because of two reasons. First, classically they have been 
intensively studied. Second, one can extract from them the motivations for the 
basic constructions which will be discussed below. 

2.1.2. Nonlinear constraints. In the Hamiltonian setting, the nonholonomic con- 
straints are given by a submanifold (not necessarily a vector subbundle) C C 
T*M. Similarly, nonholonomic Lagrangian or kinemMic constraints are given 
by a submanifold C C TM . If the Hamiltonian system [M, H) comes from 
a Lagrangian system {M,L), then C = Cl{C') if and only if C = a_f/(C). 
In mechanics, these two approaches correspond to two possible descriptions of 
nonholonomic constraints: either as limitations imposed on the momenta or as 
limitations imposed on the velocities, respectively. The fact that C (resp., C) 
represents limitations imposed only on the momenta (resp., velocities), but not 
on the configurations of the system, implies that the projection tt must send 
C (resp., C") surjectively onto M. However, the assumption of "infinitesimal 
surjectivity" of ■n\c is more adequate in this context. This means that tt\c is a 
submersion, i.e., dy{'K\c) ■ TyC T.,^(y)M is surjective for all y G C. With this 
motivation, we adopt the following definition. 

Definition 1. A set of nonholonomic constraints imposed on a Hamiltonian 
system (M, H) is a submanifold C C T*M such that n\c is a submersion. The 
constrained Hamiltonian system is denoted by {M, H, C) . 

Since C and aniC) are, respectively, interpreted as the constrained cotangent 
and tangent bundle of the system {M,H,C), we will always assume that they 
have equal dimensions. It is worth stressing that the above definition makes also 
sense for manifolds with boundary. In such a case, the boundary of T*M is 
■K-\dM) and the boundary of C is 5C = C n n-\dM). 
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Remark 1. Similarly, nonholonomic Lagrangian constraints are represented by 
submanifolds of TM that project regularly onto M. 

In what follows, 'Pi{q,p) = 0, i = 1, ...,m, will denote a set of local equations 
defining C. For a point x € M, we denote by Cx the fiber of C at x, 

Cx = Cn T;M = {y€C\ 7T{y) ^ x} ^ {nic)-\x) . 

2.1.3. Instantaneous nonholonomic constraints. Let A'^ be a hypersurface in M. 
Consider the induced hypersurface T^M = tt-^{N) of T*M. Let C C T*M 
be a set of nonholonomic constraints on M. Instantaneous constraints may be 
thought as limitations on the momenta (resp., velocities) of the system that 
are imposed only at the instant when a trajectory passes through a point of 
N. Therefore, they are represented by a submanifold C™''* of C n T^M. These 
constraints are assumed to be additional to the ones already prescribed by C. In 
order to admit an adequate mechanical interpretation, we also assume that the 
projection tt restricted to C™''' is a submersion onto N. From the Lagrangian 
point of view, instantaneous kinematic constraints are naturally interpreted as a 
submanifold C™*** of TN. Based on these considerations, we take the following 
definition. 

Definition 2. Let (M, H, C) be a constrained Hamiltonian system and let N be 
a hypersurface of M . A set of instantaneous constraints along N imposed on 
(M, H, C) is a submanifold C™"* of C Ci T'^M such that it restricted to C™"' is 
a submersion onto N . 

It is worth stressing that, in some cases, a set of instantaneous constraints 
along N additionally verifies the condition (7™''' c (TN) (here TN is thought 
to be naturally embedded into TM). In an inelastic scenario, where the non- 
holonomic motion in M is forced to take place in N after the impact, this latter 
condition formalizes the parity between the Hamiltonian and Lagrangian ap- 
proaches: if the Hamiltonian system in question comes from a Lagrangian one, 
then (7'"''* = £l(C""''*'), with C™^^' being the instantaneous kinematic con- 
straints. 



2.2. Dynamics of Hamiltonian systems. As is well-known, in the absence of con- 
straints, the dynamics of the Hamiltonian system (Af, H) is given by the Hamil- 
tonian vector field Xu^ whose coordinate description is 

dq" _ dH dpa _ dH _^ 
dt dpa ' dt dq°- ' 

In the presence of constraints, the "free" Hamiltonian vector field Xh must be 
modified along the constraint manifold C in order to become tangent to C. In 
the traditional approach this goal is achieved by adding to Xh another vector 
field along C, say, R, interpreted as the reaction of constraints. From a purely 
geometrical point of view, the choice of a vector field that makes Xh tangent to 
C is far from being unique. Therefore, a new principle must be invoked to select 
the one that merits to be called the "reaction of constraints" . The history of this 
problem (see, for instance, |5B)) shows that its solution is not straightforward. 
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By applying, for instance, the Lagrange-d'Alenibert principle (see pii;^ll28) V one 
gets the following equations of motion 

dq- dH dp, dH d<P, 

a = l,...,rt, i — l,...,m, where the "Lagrange multipliers" A^'s are to be 
duly determined. Shortcomings of such an approach are that it is not manifestly 
intrinsic and does not reveal clearly the geometric background of the situation. 
This is why in our further exposition we shall follow a purely geometric approach, 
which does not require any discussion of reactions of constraints. It is based 
on the concept of partial symplectic formalism, which also appears to be more 
concise from an algorithmic point of view. 



2.3. Partial symplectic structures. The following elementary facts from linear 
algebra will be most useful. Let V he a vector space, W C V & subspace and 
b : V xV ^ M. a. bilinear form on V. Denote by the 6-orthogonal complement 
of W, 

Wj^ = {veV\ b{v,w) = 0,\/w e W}. 

Note that W fl = if and only if the restriction 6 of 6 to is non- 
degenerate. The form b is said to be nondegenerate on an affine subspace U of 
V, U — po + W, po G V, if it is nondegenerate on its associated vector space 
W. In such a case, U can be uniquely represented in the form U — pi + W 
with pi G due to the fact that U f) = {pi}. The vector pi is called the 
canonical displacement of U with respect to b. Consider the associated map 

Tw,b-W — >W*, Tw.bH ^b{w,-), w €W. 

In other words, TiY^b{w){w') — b{w,w'), for all w' £ W. Obviously, Tw,b is an 
isomorphism if and only if b\w is nondegenerate. 

If b is skew-symmetric and nondegenerate on V, and is a subspace of V 
with codimension one, then the kernel of the restricted form b\w, kerfej^y is a 1- 
dimensional subspace, i.e., a line in V contained in W. Therefore, ker = Wb'- 

Let now A be an affine distribution on a manifold Q. A form oj G A^{Q) 
is called nondegenerate on A if b — lOx is nondegenerate on U — A^, for all 
X G Q. In such a case, there exists a unique vector field Y G D{Q) such that 
Yx is the canonical displacement of A^ with respect to ojx, for all x G Q. The 
vector field Y = Ya,u is called the canonical displacement of A with respect 
to If ijj G A^{Q) is nondegenerate on A, then one has the isomorphism of 
vector bundles 

7 = lAo,. ■■ A^* A\ 7. = -(Tao,^J-i : zif A^ 

Passing to sections of these bundles, one gets the isomorphism of C°°((5)-modules 
^A«,^ : A\o{Q) ^ D^.{Q) defined by 

r^rAO^^{Q){x)='l{Q{x)), QGA\o{Q). (1) 

Definition 3. A partial symplectic structure on a manifold Q is a pair {A,lj) 
consisting of an affine distribution A on Q and a closed 2-form lo G A^{Q) which 
is nondegenerate on A. 
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Given a partial symploctic structure Q = {A,uj), we will use the subscript 
to denote the associated objects: Z\© = A,u)0 = co,Y@ = Ya,oj and zi@ for the 
distribution canonically associated to A. We also write 

In the partial symplectic formalism, the elements of C°° ((5)-niodules Dq and A^ 
may be viewed as "constrained" vector fields and differential forms, respectively. 
The constrained Hamiltonian vector field associated with a Hamiltonian function 
H G C°°(Q) is defined as 

X^ = re{re{dH)) + Ye. (2) 

The almost-Poisson bracket associated to the partial symplectic structure is 

{/, <7}e = re(re(rf/))(5) = Xf - Yeig) , f,gG C°°(Q) . 

The wording "almost" here refers to the fact that this bracket does not satisfy 
in general the Jacobi identity. However, it is still skew-symmetric and a bi- 
derivation. 

Definition 4. Let = {A, uj) be a partial symplectic structure on a manifold 
Q. A hypersurface B c Q is transversal to (or to A) if the affine suhspaces 
TyB and Ay of TyQ are transversal for any y € C. 

If B is transversal to 0, then TyB D Ay is of codimension 1 in Ay. If is a 
partial symplectic structure on C C T*M, we shall extend this terminology by 
saying that is transversal to a hypersurface B in T*M if B is transversal to 
C, so that B = BCiC is a. hypersurface in C, and B is transversal to 0. 

3. Dynamics of nonholonomic Hamiltonian systems 

In this section, we formulate the dynamics of nonholonomic Hamiltonian systems 
using the partial symplectic formalism. We show how, under some technical con- 
ditions, any Hamiltonian system subject to nonholonomic constraints possesses 
an associated partial symplectic structure. Then, we analyze the cases of systems 
with instantaneous nonholonomic constraints, and systems exhibiting disconti- 
nuities. 



3.1. The partial symplectic structure associated with a constrained Hamiltonian 
system. Let (M, H, C) be a constrained Hamiltonian system. Our goal is to as- 
sociate with it a partial symplectic structure on the "constrained" cotangent 
bundle C in such a way the corresponding constrained Hamiltonian field 
gives the desired nonholonomic dynamics. With this purpose, consider the con- 
strained symplectic form defined by the restriction of the "free" symplectic form 
Qm to C 

iO0=j*{f2M), (3) 

with j : C ^ T*M the canonical inclusion. The next step is to construct a 
suitable affine distribution Aq on C. A natural non-singularity requirement on 
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C is asking for the regularity of the map aH\c- This is the reason why we assume 
that aH\c is an immersion, i.e., that the differential dyan is nonsingular for any 
y € C. Since an ■ T*M TM is fibered, this assumption implies that the map 
{<xh)x ■ Cx ^ TM is an immersion for any x G Af and vice versa. 

Let y & C and x = TT{y). Let Uy be the affine subspace of T^M tangent to 
oih{Cx) at z = otH{y)- Since by the above assumption aH\c is an immersion, 
dimiJy = (XimCx = n — m. Consider the afhne distribution Aq on C defined by 

Aoy = {e e TyC I dyiriO e Hy} C TyC. (4) 

Since dy{T:\c) is surjective, the codimension of A^y in TyC is equal to the 
codimension of Uy in T^M , i.e., to m. Therefore, dim A^y = 2(n — to). It is not 
difficult to see now that if the form loq is nondegeneratc on the distribution Aq , 
then anlc is an immersion. 

Proposition 1. Let S — {M, H, C) he a constrained Hamiltonian system. Then, 
ctffic is an immersion if the pair (Aq^ujq) defined by equations © and Q is 
a partial symplectic structure. 

The converse, however, is in general not true. Since the partial symplectic 
structure associated with S = (M, H, C) is determined by H and C, we will 
simply denote it by 0{H,C) = (Ah^Ct^^c)- 

For most Hamiltonian systems (including those coming from Mechanics), the 
anti-Legendre map an is regular not only when restricted to C, but on the 
whole space T* M . In this is the case, and the Hamiltonian system comes from 
a Lagrangian system, one can indeed show that the condition of being non- 
degenerate on the distribution Aq is equivalent to the so-called compatibility 
condition pil21| . Therefore, Proposition ^ establishes a link between the classi- 
cal partial symplectic formalism introduced in |S] and more recent approaches 
as explained, for instance, in ^Jl- Also note that the class of mechanical systems 
automatically verifies the compatibility condition, therefore admitting both for- 
mulations. Indeed, for mechanical systems, the conditions in Proposition ^ are 
equivalent. 

Definition 5. A nonholonomic Hamiltonian system on a manifold M is a con- 
strained system {M,H,C), H e C°°{T*M), C C T*M such that e{H,C) = 
(Ah.c^c) is a partial symplectic structure. 

The dynamics of a nonholonomic Hamiltonian system is given by the con- 
strained Hamiltonian vector field with respect to the partial symplectic 
structure = 0{H,C) (cf. equation (O). This vector field will be denoted by 
Xh,c- Under regularity of the map an, Xh.c reads in canonical coordinates 

^ _dH d fdH ^ /'dHd'P^ dH d^^\ d<P'^ \ d 
"''^ dpadq" \dq°- \dpb dq'' dq^ dpb ) dpc "j dpa ' 

where the matrices {TLab) and (Cy ) are defined by 
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Observe that the force of reaction of nonholonomic constraints (see Section 1^21 
in the partial symplectic framework is defined a posteriori as the difference 
between the constrained and the free Hamihonian vector fields, Xh,c ~ Xh- 
Also, note that the almost-Poisson bracket associated with the partial symplectic 
structure 0h,c coincides with the so-called nonholonomic bracket [OIT^ . 

Transversality. It is convenient to adapt the terminology related to the notion 
of transversality discussed in Section to the context of nonholonomic Hamil- 
tonian systems. First, we shall say that a nonholonomic Hamiltonian system 
5' = {M, H, C) is transversal to a hypersurface B in T* M if the underlying 
partial symplectic structure 0{H, C) is transversal to B. Second, if is a hy- 
persurface in M, we shall say that S is transversal to N if S is transversal to 
the hypersurface T^M. The following result follows from the definition of the 
partial symplectic structure 0{H, C). 

Proposition 2. A nonholonomic Hamiltonian system {M, H, C) is transversal 
to a hypersurface N C M if and only if Uy C T^^^y^M , the affine subspace of 
TT^(^y)M tangent to a_ff(C7r(y)) at z ^ aniy), is transversal to TT^(^y)N C T^^^^y^M 
for all y G C. 

3.2. Instantaneous partial symplectic structures. It is intuitive to think that 
when a trajectory of a Hamiltonian system (M, H, C) crosses a critical hyper- 
surface N in the configuration manifold M, its phase space reduces to T*N. 
Moreover, it could possibly be subject to additional instantaneous constraints 
along N. In the language of our approach, this idea is naturally expressed by 
saying that all such critical states constitute a nonholonomic Hamiltonian sys- 
tem on N. Since T*N is not naturally embedded into T*M, a realization of 
this idea is not completely straightforward. What one really needs is a partial 
symplectic structure on the manifold of instantaneous constraints C™*** which, 
by definition, is a submanifold of T*M 

Namely, let C™^^ be a set of instantaneous constraints along N imposed on 
{M,H,C) (cf. Definition 0). Take y e C'""'. Let x = TT{y) and denote by 77^""' 
the affine subspace ofT^N C T^M tangent to a/f (C^"^') at aniy). Consider the 
2-form o^Qinst and the afhne distribution Zigiinat on C""*** defined by 

^e>- = fi^M) , Ae....y = {C e T.C"-* | d,7r(0 e n^^'} c r,C'-', (5) 

with j : C^'^^^ ^ T*M the canonical inclusion. We then have the following 
definition. 

Definition 6. Let (M, H, C) he a nonholonomic Hamiltonian system and let C*'"*'' 
he a set of instantaneous constraints along a hypersurface N d M . The pair 
(zigmst, ujQinat) dcfincd by (O is called the instantaneous partial symplectic struc- 
ture along TV if ujQm.,t is not degenerate on Zigmst. If this is the case, C*™"* is called 
a regular set of instantaneous constraints. 

Note that this structure is defined by H, C""*** and N . To highhght this fact, 
we denote 0'""' = 6»'""'(i?, C'""*, iV). Accordingly, we denote by X(h,c^^-\n) the 
constrained Hamiltonian vector field A^l"^*, with H™^^ ~ iJl^inat. 
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In general, since C""'- c C by definition, one has that anlcjinst is an immer- 
sion. For mechanical systems, this implies that the 2-form Wginst is nondegenerate 
on zigiinst, and therefore any set of instantaneous constraints is regular. 

In what follows, wc shall only deal with regular instantaneous nonholonomic 
constraints. A natural class of instantaneous structures arises in the following 
situation of particular interest. Assume that the nonholonomic Hamiltonian sys- 
tem S = (M, H, C) is transversal to N and that Uh is regular. Then (TN) 
is transversal to C and, hence, 

C^N,H)=^H'iTN)nC 

is a submanifold of codimension 2 in C. Note that C^^^^h) is a set of instantaneous 
nonholonomic constraints on S along N. By construction, the codimension of 
ziATj, = Z\j, nTj,(C(jv,_f/)) ill is also 2 and fiy is nondegenerate when restricted 
to Therefore, the afHne distribution An and the 2-form i7|jv endow C(^n,h) 
with a partial symplectic structure, which is an instantaneous partial symplectic 
structure along A''. We call it the trace of S on N and denote it by Sf^jf jjy 
In the special case N = dM, we call it the boundary of S, and denote it by 
dS, i.e., dS = S(^qm,h)- We will denote the constrained Hamiltonian vector field 
with respect to the trace (resp., boundary) as X*"" = X^^jj c n) (I'esp., = 



3.3. Discontinuous nonholonomic systems. An impulsive behavior of a Hamil- 
tonian system occurs when its trajectory "tries" to go across a critical hyper- 
surface N in the configuration space M . In such an instant, the system may 
be forced to drastically change its constraints, to pass under the control of an- 
other Hamiltonian and/or to be eventually subject to additional instantaneous 
constraints. Such situations may be interpreted as discontinuities on both the 
constraints and the Hamiltonian of the system. Below, we formalize these con- 
cepts properly via the notion of cutting- up. 

Definition 7. Let N c M be a hypersurface of M with N n dM = 0. ^ pair 
(M, <r), <; : M — > M, is called a cutting- up of M along N if 

(i) N ^c-'^{N) cdM; 

(ii) maps M\N diffeomorphically onto M \ N; 
(Hi) (^Ij^ : N ^ N is a double covering of N. 

Note that, by definition, <; is a local diffeomorphism. Cuttings-up for a given 
N exist and are equivalent one to each other. If A'' divides M into two parts, 
say, M+ and Af _ , i.e. , M = M+ U M_ , M+ n M_ = N, then M may be viewed as 
the disjoint union of M_|_ and M_, and <; as the map that matches them together 
along the common border N. Locally any cutting-up is of this form. 

For our purposes, it is important to realize that, if (M, <r) is a cutting-up of M 
along A^ , then {T*M, T*<;) is a cutting-up of T*M along the hypersurface r;^M. 
Here, T*c; denotes the dual of the inverse of the isomorphism d^c : T^M — > 
T^i^^-jM, for all z G M. In the following definition, wc introduce the class of 
Hamiltonian systems we shall be dealing with throughout this paper. 
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Definition 8. Let N C M be a hypersurface of M with N fl dM = and 
let {M,<;) be a cutting-up of AI along N. A nonholonomic Hamiltonian system 
discontinuous along N , denoted S = (M, H, C \ N) , is the direct image with 
respect to T*<; of a nonholonomic Hamiltonian system {M,H,C). Such system 
is called regular if{AI,H,C) is transversal to N. 

A system of instantaneous nonholonomic constraints on S along N is the 
direct image with respect to T*(, of a set of instantaneous constraints C™'** along 
N on the associated system S — {AI^H,C \ N). The trace of 5 on is the 
direct image with respect to T*(, of the trace of S along N . 

According to Definition |H1 H is a. smooth function on T*M and C is a sub- 
manifold of T*AI. Therefore, the direct image of H along the matching map 
T*<^ : T*M — » T*AI may be viewed as a function on T*M, which is 1-valued and 
smooth outside of T^AI and 2-valued and smooth on T^M . We will continue 
to use the notation H for this function and will refer to it as a discontinuous 
Hamiltonian along N . Similarly, the direct image C = T*^(C) of C will be re- 
ferred to as discontinuous nonholonomic constraints along N . Outside of T^AI , 
C is a "good" smooth submanifold of T*AI, whose boundary is an immersed 
submanifold of T^M. 

The previous discussion becomes particularly simple when N divides M into 
two parts, Af+ and M_, as mentioned above. In such a case, T^M also divides 
T*AI into two parts, T*AI+ and T*M_, whose common boundary is T^M. 
Then, a discontinuous Hamiltonian H along N may be naturally seen as a pair 
of Hamiltonians, say, H^ and defined on T*M+ and r*M_, respectively. 
Similarly, a set of discontinuous nonholonomic constraints along N is regarded 
as a pair of sets of nonholonomic constraints C± C T*AI±. Since N always 
divides AI locally, this description constitutes a local picture of a discontinuous 
nonholonomic Hamiltonian system along TV. 

We will continue to use the notation C™^^ (resp., S'*'') for instantaneous non- 
holonomic constraints (resp., the trace of S) in the case of discontinuous non- 
holonomic systems. As before, one may interpret C""*^* as a 2-valued system of 
instantaneous nonholonomic constraints along N . In the case when N divides 
M into two parts, we will distinguish between the two branches using the nota- 
tion C^t^*, and write also -'^(/f.c^f^'.Af) (resp., AT*]^ ^^). 

Remark 2. The impulsive behavior of a Hamiltonian system is not necessarily 
related to some discontinuity. This type of phenomena occurs, for instance, each 
time that one of its trajectories "strikes" against the boundary dM of the config- 
uration space AI . Various kinds of collisions, impacts, etc, in mechanical systems 
are described in this way. Otherwise said, impulsive behavior is characteristic of 
Hamiltonian systems with boundary. Moreover, systems with boundary may be 
viewed as a "limit" case of discontinuous systems by dropping the requirement 
N n i9M = and choosing N = dM, AI^ = 0, AI+ = AI. This allows a unified 
approach to both situations. 

4. The Transition Principle 

In this section we discuss the formulation of the Transition Principle for sys- 
tems subject to nonholonomic constraints. We first introduce the notions of 
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focusing points, constrained characteristics, and in, out and decisive points. The 
Transition Principle builds on these elements to prescribe the behavior of the 
Hamiltonian system when one or more of its ingredients undergoes a drastic 
change. 

4-.1. Focusing points. The following simple linear result will be key for the sub- 
sequent discussion. 

Lemma 1. Let y G T* M , and let W be an affine subspace in Ty{T*M) such that 
Qy is nondegenerate on W (hence, dim W — 21 for certain I ) and dim dyTT(W) — 
I. Denote by W° C Ty{T* M) and dy-K{wf = dyTi(W^) C r^(y)M the linear 
subspaces associated with the affine spaces W and dyTr{W), respectively. Then 
the affine subspaces W = y + Ann{dyTT{W)°) and W, ^ y + W° H Ty{T*i^y^M) 
in T*i^y^M passing through y are transversal. 

Proof. Since, by hypothesis, dimdy7r(W^) = I, one has 

dim n Ty{T*f^y^M) = I and dimdy'K{W)° = I . 

Now, the dimension of Ann{dyTT{W)°) C T*f^y-^M is n - I. Moreover, n 
Ty{T*^y^M) is transversal to Ann{dyTr(W)°) if one identifies the spaces T*^^y-^M 
and Ty{T*^y^M). The resuh now follows. □ 

Consider now a nonholonomic Hamiltonian system {M,H,C). Let y d C. 
Denote by Zi = Zij-jj be the affine distribution of the corresponding partial 
symplectic structure 0{H,C) (cf. Section By Definition |5l the affine sub- 
space W = Ay satisfies the assumptions of LemmaHon W (observe that dyTT{W) 
is precisely Uy in equation Q). Therefore, the subspace W — A* is well-defined 
and we put 

Ky = Ky{H,C)^A;cT:^y^M. 

Moreover, it is not difficult to see that the subspace W, — (Ay), is identical to 
TyC^^i^yY This shows that Ky is transversal to C at y, and that divciKy — m. 
The crown of the nonholonomic Hamiltonian system (M, H, C) is the map 

K^KH^C-C ^AraiT*M), y ^ Ky , 

where Ak(T*M) denotes the manifold whose elements are /c-dimensional afhne 
submanifolds contained in the fibers of the cotangent bundle T*M . One can see 
that the graph of the crown k, 

Graph(K) = {{y, u) e C x T*M \ v £ Ky} 

is a 271-dimensional smooth submanifold of C x T*M . Note that Graph(K) is a 
fiber bundle over C with projection 

P = P(H,C) ■ Graph(K) — > C , {y,v) i-^y. 

The fiber over y of this bundle is precisely Ky. Since y € Ky, the map 

cr : C — > Graph(K) , y ^ {y,y) , 
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is a section of P{h,c)- Since the fibers of P(h.c) ^-re afEne spaces, the bundle 
Graph(K) — > C has a natural vector bundle structure whose zero section is a. 
Moreover, this vector bundle is canonically isomorphic to the normal bundle of 
C in T*M. This is due to the fact that, for any y € C, the fiber Ky is transversal 
to C at y. The same argument also guarantees that the map 

^ = -{H.,c) ■ Graph(K) — > T*M , {y,v) i-^v, 

induces a diffeomorphism of a neighborhood of the "zero" section cr(C) in Graph(K) 
onto its image. 

Definition 9. Let (M, H, C) he a nonholonomic Hamiltonian system. Given a 
point u £ T* M , its {H, C)-focusing locus F(^jj c){u) is the set of all points y G C 
such that u £ Ky . In other words, 

F(H,C)[u) ^P(H,C) (■S'(^^,c)H) ^ '^^M ■ 
A point in F(^j^(jj(u) is called focusing for u. 

Standard arguments show that S'^^ p) is regular, i.e., of maximal rank 2n 
almost everywhere, that is, with the exception of a closed subset without interior 
points. Therefore, for a generic point u e T*M, the subset S'^j^ ^^{u) is discrete, 

and so is i^(//.c)(u) as well. Note also that if u e C, then u E F(^h,c){u)- 

Remark 3. Focusing points can be understood as nonintegrable analogs of the 
notion of reducing points considered in in connection with the Transition 
Principle for inelastic collisions. 

Remark It is worth noticing that the concept of a focusing point makes also 
sense in the absence of constraints. Obviously, in this case F(^h,c){u) — {"}• 
Therefore there is no need to distinguish between the constrained and non- 
constrained cases in the statement of the Transition Principle. 

If the constraints are linear, i.e., C = T + C° with C" a linear codistribution 
and T G A^{M) (the displacement form), then for each y G T*M, 

T;(^)Af = C°(^)®Ann(d,^(zi,)), 

where A = zi(^ Denote the corresponding projectors by V : T*^y^M ^^(y) 
and Q : T*^y^M Ann{dyTT{Ay)). Given u G T*M, one has that z G F(^h.c){u) 
if and only if z G C and r{u). Since z = V{z) + Q{z) ^ r{u) + Q{Ty), 

one has the following result. 

Proposition 3. Let {M, H, C) he a nonholonomic Hamiltonian system with lin- 
ear constraints. Then, for u G T*M , there is a unique focusing point given by 
F(H,c){u) = {V{u) + Q{Ty)}. 
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^.2. Instantaneous focusing points. We will also need an instantaneous version 
of the notion of a focusing point introduced in the previous section. For this 
purpose, it is sufficient to apply the above construction to instantaneous con- 
straints instead of to the "usual" ones. Namely, let C™^*" be a system of regular 
instantaneous constraints along N (see Section I^^SJ and = Z\giinEit be the 
corresponding afhne distribution (see Section|^21). Following the same reasoning 
as above, the afhnc subspace W — A^y^^ C Ty{T*M) satisfies the assumptions 
of LemmaE Therefore, the affine subspace K^J'''^ = (Z\'""')' ofTy{T*M) is well- 
defined, and we have all the ingredients to define the notion of instantaneous 
crown and instantaneous focusing point of a system subject to instantaneous 
nonholonomic constraints. For completeness, we state the latter. 

Definition 10. Let {M,H,C) be a nonholonomic Hamiltonian system and let 
C™"* be a set of instantaneous constraints along a hypersurface N C M. Given 
a point u S T^M, its (H, C™**, A^) -instantaneous focusing locus F(^h,C"^'KN){u) 
is the set of all points y G C""** such that u S if™**. In other words, 

F{H,C'"'t,N){u) = P(H.,C'"--') (-(ff,C™",JV)(")) ^ '^T{u) ■ 

A point in Fi^fj Q,n.,t j^'^{u) is called instantaneous focusing for u. 

As before, if the instantaneous nonholonomic constraints are linear C*™*** = 
2^inst _^ (;;inst°^ ^Yien for each y E T'^M, 

T:^y)M = C7°(^) ® Ann(d,^(Z\^-*)) , 

where = A^fj.c^'^'^,N)- Denoting the corresponding projectors by p'"*** : 

T*^y)M C'""'°(y) and Q'""* : T*^y)M kiTa{dyTi{A^''^)), one has the follow- 
ing result. 

Proposition 4. Let (M, iJ, C) he a nonholonomic Hamiltonian system and let 
C"'"* he a set of instantaneous affine constraints along a hypersurface N C M . 
Then, for u £ T* M , there is a unique instantaneous focusing point given by 
F^H,C"^",N)iu) = {P™^*(7/) + Q»^*(r™^t)}. 

4-3. Constrained characteristics. Consider then a partial symplectic structure 
— {A,u!) on a manifold C which is transversal to a hypersurface B C C (cf. 
Definitional). Let Z\° denote the linear distribution associated with A. For each 
y G B, consider the Hnear space V = Ay, the hyperplane W — Ay CiTyB of 
V and the nondegenerate skew-symmetric form b — ujy\^o. The characteristic 
direction at y G B is defined as 

ly = ly{e,B) = ker6|vF = ker(cjj,UonT,B) C AlnTyB. 

The proof of the following result is straightforward. 

Lemma 2. Given a partial symplectic structure O — {A,lo) on a manifold C 
and a hypersurface B C C transversal to it, the distribution y i— » ly{0,B) is 
one- dimensional. 
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Definition 11. Given a partial symplectic structure = {A,lu) on C and a 
hypersurface B <Z C transversal to it, y ^ ly(0,B) is called the characteristic 
distribution with respect to (6*, B), and its integral curves, denoted by are the 
{0, i3)-characteristics. 

We are particularly interested in the case when we have a nonholonomic 
Hamiltonian system S = (M, H, C), the partial symplectic structure is 0h,c, 
iV is a hypersurface in M and i? = T^M, B = T^jM n C. We will use 
the terminology {S\N)- or {H,C\N)- characteristic as a substitute for {0,B)- 
characteristic. It should be emphasized that {H, C|A^)-characteristics are only 
defined when S is transversal to N (see Section IXT|l . 

In the absence of constraints, i.e., when (C = T*M,uj = i?) is a symplectic 
manifold, and A is the trivial distribution y i— > TyC on C, the characteristic 
curves are precisely the characteristics introduced in |^. We will refer to non- 
constrained characteristics and constrained characteristics when it is necessary 
to distinguish between the unconstrained and the constrained cases. 

Remark 5. Just as non-constrained characteristics play a key role in describing 
holonomic elastic collisions, and reflection and refraction phenomena of rays of 
light , the constrained characteristics will be fundamental in describing the 
"elastic part" of nonholonomic impulsive phenomena. What is meant by "elastic 
part" will become clear in Section IT!5l when describing decisive points. 

If the constraints C are affine, then the {H, C|7V)-characteristic passing through 
a point y £ C , 7r(y) e iV, is described in a particularly simple way. Namely, fol- 
lowing Proposition 121 it is not difficult to see that the C| A'')-characteristic 
passing through y is given by 

= y + Cl^y) n Ann(d,aH(C°) n T,(y)N) , 

with being the linear codistribution associated to C . In particular, in the 
absence of constraints, C = T*M and the characteristics are straight lines in 
T*M parallel to Ann(r^iV), x eN. 



4-4- I'n, out and trapping points. Here, we first introduce some concepts con- 
cerning the behavior of a vector field in a neighborhood of the boundary of its 
supporting manifold. We then discuss the notions in, out and trapping points. 

Let Q be a manifold with boundary and X a vector field on Q. A point 
y e dQ is called a jth order in point for X if there exists a trajectory of X, 
/3 : [0, a] — > Q, a > such that 

y = /3(0) and f3{t) i dQ , for < i < a , 

and /3 is jth order tangent to dQ at y. A jth order out point for X is a jth 
order in point for —X. In the dynamical context we have in mind, in and out 
points of 0th order are the most important. It is easy to see that y £ dQ is a 
0th order in point (resp., out point) for X if the vector Xy is transversal to dQ 
and directed inside (resp., outside) of Q. A point that lies on a trajectory of X 
which is entirely contained in dQ is called a trapping point for X . 
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Let dQ^ = dQ^{X) denote the subset of all points of dQ where X is jth order 
tangent to dQ, and dQi, = dQy{X) (resp., 9(5< — dQ-'^{X)) the set of all jth 
order in points (resp., out points) for X. Note that dQ^ D dQ^^^ and 

dQ' \ {dQi U 9Q^< ) c ag^+i . (6) 

In a generic situation, dQ^ is a submanifold (with singularities) of codimension j 
in dQ, which is divided by dQ^^^ into two parts, 9Q> \ dQ''y'^ and dQ^^ \ dQi^^. 
An analytical description of the previous discussion is obtained by choosing a 
smooth function f on Q with / > and dzf ^ 0, for all z e dQ such that 
(^Q = {/ = 0} (which always exists locally). Then 

dQ' = {z e g I !{z) = 0, X(/)(z) - 0, ... , X\!){z) = 0} , 

5Qi \ - dQ' n {z e Q I X'+^{f)[z) > 0} , 
dQ'< \ dQ'+' = dQ' n {z G Q I X^+^(f){z) < 0} . 

The vector field X is said to be regular with respect to dQ when the inclusion 
in equation ^ is an equality for all j > 0. This is a generic property of vector 
fields. In such a case, the chain of inclusions 

dQ^dQ°Z)dQ^Z) - --DdQ' Z)---D dQ'' 

is a stratification of dQ whose strata are dQ'^ \ dQ'^^ and dQ'^ \ dQ'^^ . Note 
also that the set of trapping points precisely corresponds to dQ"'. 

Consider now a discontinuous nonholonomic Hamiltonian system (M, H, C \ 
N) and the corresponding cutting-up (M, <j). Then we can resort to the previous 
discussion with the manifold Q — C C T* M and the vector field X — Xfj ^. 
Recall that N C dM and dC ^ C f^ T*^jM. 

Definition 12. Let S — {M,H,C \ N) be a discontinuous nonholonomic system 
and denote by (M, <;) the associated cutting-up. A point y G T^M is called an 
in point (resp., an out pointj of S if there exists z £ Tj^AI such that y — <j(z) 
and z is an in point (resp., an out point) of Xj^ ^ with respect to dC . 

By definition, the map T*(, restricted to dC is an immersion. A point in T^M 
may turn out to be an in and an out point at the same time. To resolve this 
ambiguity, the branch of T*^ to which such a point belongs must be taken into 
consideration. This distinction is easily described in the case when TV divides M 
into two parts. In fact, in this case the system (M, H, C \ N) may be viewed as a 
couple of nonholonomic Hamiltonian systems {M±, H±, C'±), with the common 
boundary dM± = N, and where H± £ C°°{M±) and C± C M± (cf. SectionESJ. 
An in (resp., out, or trapping) point of the vector field Xh^.c+ with respect to 
the boundary 9C+ is called an plus-in (resp., plus-out, or plus-trapping) point. 
Analogous definitions are established for e = — . In this way, the notions of plus- 
in point, minus-in point, etc, introduced in [Hj for the unconstrained situation are 
generalized to the constrained case. Finally, we observe that N always divides 
M locally, and therefore the previous discussion is always valid locally. 
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4-.5. Decisive points. At this point, we are ready to introduce the key notion of 
decisive point corresponding to an out point. The construction of decisive points 
depends on two elements: first, the mode (elastic or inelastic) in which the system 
passes through the critical state and, second, the continuity and differentiability 
properties of the Hamiltonian. Below, we will limit our discussion to the two 
most relevant situations, just to avoid not very instructive technicalities arising 
in the most general context. The first one is the case when the Hamiltonian is 
smooth and only the constraints are discontinuous along the critical hypersur- 
face. The second one concerns discontinuous Hamiltonians and not necessarily 
discontinuous constraints, ft is worth stressing that the first situation can not 
be considered as a particular case of the second one, i.e., that the notion of a 
decisive point is not "continuous" in this sense. In what follows, e G {+,— } 
and e stands for the opposite sign to e. Throughout the section, instantaneous 
constraints are assumed to be regular. 

4-.5.1. Elastic mode: change of constraints. Here, we deal with a discontinuous 
nonholonomic system {M, H,C\N), where the Hamiltonian function is smooth, 
H e C°°{M). 

Definition 13 (Decisive points for smooth Hamiltonians and discontin- 
uous constraints). Let (M, H,C\N) be a regular discontinuous nonholonomic 
system, with H G C°° (M) and consider a set of instantaneous constraints C^""* 
along N . Let y he an e-out point of the system. A sequence (jji, Si), i = 0,1, ... ,k, 
with y.i £ C r\ T^M is called (y, e) -admissible if it verifies the following condi- 
tions: 

("jj (2/o,£o) = (2/,e); 

(ii) for all i < k, yi+i is a focusing point for yi with respect to either C™^* or, 

if instantaneous constraints are absent, C^^^^ ; 
(Hi) yi is an Si-out point for all i < k and y^ is either an Sk-in point or an 

Ek-trapping point; 
(iv) the sequence of signs {si} alternates, i.e., Ei+i = Si. 

The end point of an {y,e)- admissible sequence, {yk,£k), is called (y, e) -decisive 
and the constrained Hamiltonian vector field XH,Ce^ referred to as the vector 
field corresponding to it. 

Remark 6. The above formal description of decisive points is equivalent to the 
following iterative procedure. Take, for instance, a plus-out point y. Then, ac- 
cording to Definitional all focusing with respect to C™^^ (resp., to C^) minus-in 
and minus-trapping points are decisive. On the other hand, the procedure con- 
tinues by restarting from any of the remaining focusing points that are minus-out 
points, and so on. In some situations, this process may turn out to be infinite. 
At the present time, however, it is not clear whether that kind of phenomena 
can occur, say in propagation of singularities or similar processes. 

4.5.2. Elastic mode: discontinuous Hamiltonians. In this case, decisive points 
are constructed on the basis of an iterative procedure whose single steps are 
either of reflective or of refractive type, as described below. Consider a regular 
discontinuous nonholonomic system S = (M, H,C\N), which might be subject 
to additional instantaneous constraints C™^*" along N. Let y G C fl T^M be an 
e-out point. 
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Reflective step. 

1- st move: y ^ z, where z is a point in the constrained characteristic C,y{H^, C^) 

such that Hf;{z) = Hs{y). 

2- nd move: z m, where u is a focusing point for z with respect to either C"'''' 

or, if e-instantaneous constraints are absent, C^. 

Refractive step. 

1- st move: y ^ z, where z is a point of the constrained characteristic Qy{Hg^ C^) 

and such that Hg{z) = H^{y). 

2- nd move: z u, where u is a focusing point for z with respect to either C^"*'* 

or, if e-instantaneous constraints are absent, Cg. 

With a shght abuse of language, we shall say that {y,e) is the initial point 
of the step and (m, e) (resp., (u, e)) is the end point of the step if the scenario is 
reflective (resp., refractive). 

Definition 14 (Decisive points for discontinuous Hamiltonians). Con- 
sider a regular discontinuous nonholonomic system {M,H,C\N). Let C'™''* be a 
set of instantaneous constraints along N . Let y be an e-out point. A sequence 
(yi^Ei), i = 0, 1, . . . , fc, is called (y, e)-admissible if 

(i) {yo,£o) = {y,£); 

(ii) (yi^Si) and (j/i+i, Si+i) are the initial and the end points of a step, respec- 
tively; 

(Hi) yi is an Si-out point, < i < k, and yk is an Ek-in point or an Sk-trapping 
point. 

The end point {yk,£k) of an admissible sequence is called {y,e)- decisive and the 
constrained Hamiltonian vector field Xh,c^^ *s referred to as the vector field 
corresponding to it. 

If the Hamiltonian is discontinuous and the constraints are linear, i.e., C C 
T*M is a smooth linear submanifold, and the instantaneous constraints are 
absent, the previous definition of decisive points becomes much simpler, as the 
following result shows. 

Proposition 5. Let (Af , H, C\N) be a regular discontinuous nonholonomic Hamil- 
tonian system with smooth linear constraints. Let y be an e-out point. The 
(y, e) -decisive points are the in and the trapping points belonging to the in- 
tersection of the constrained characteristic Cy passing through y with the set 
{z€C\H±{z)=H,{y)}. 

Proof. Let y be an e-out point and denote by {zi, . . . , Zg} (resp, {zi, . . . , Zs}) 
the points belonging to the intersection of the constrained characteristic <^y 
passing through y with the set {z G C | H^{z) = H^{y)} (resp. with {z G 
C I Hg{z) — He{y)}). Since the constraints are smooth, then u = z in the 2nd- 
move of both a reflective and a refractive step. Now, for any j G {1, . . . , s}, 
the intersection of the constrained characteristic passing . through Zj with 
the set {z e C \ H^iz) = H^{zj)} (resp. with {z G C | i4(z) = i?e(y)}) is 
again {zi, . . . , z^} (resp, {zi, . . . , z^}). The same observation holds for any z^, 
i G {1, . . . , s}. The result now follows from Definition IT^ □ 



20 



Jorge Cortes, Alexandre M. Vinogradov 



Remark 7. The introduced terminology remains valid for nonholonomic systems 
with boundary (cf. Remark [21). In such a case, one has to formally put 

A/_ = , Af+ = M , N = dM , = oo , H+ = H . 

This type of geometric data occurs in describing various collision phenomena. 

4-5.3. Inelastic mode: change of constraints. As in the elastic case, we first deal 
with the case when the Hamiltonian H is smooth. We treat an inelastic behavior 
of the system as the passage under the control of either the instantaneous dis- 
continuous nonholonomic system or, if instantaneous constraints are absent, the 
discontinuous boundary system. In this and subsequent sections, the following 
shorthand notation will be used (cf. Sections 13.21 and 



We also use this notation when the Hamiltonian H is smooth, i.e., H± = H. 

Definition 15 (Decisive points for smooth Hamiltonians and discon- 
tinuous constraints). Consider a regular discontinuous nonholonomic system 
(M, H,C\N). Let (7™''* he a set of instantaneous constraints along N. Let y he 
an e-out point. An (y, e) -decisive point is a focusing point for y with respect 
to either (7™**'*'' qj-^ ij ^/jg instantaneous constraints are absent, Cf' . The con- 
strained Hamiltonian vector field X^"'''*'*^ , respectively, Xf is referred to as the 
corresponding vector field. 

4.5.4. Inelastic mode: discontinuous Hamiltonians. As in the elastic case, deci- 
sive points are constructed on the basis of reflective or refractive steps, as we 
now describe. 

Reflected falling step. 

1- st move: y ^ z, where z is a point of the constrained characteristic C.y{Hg, C^) 

such that Hi;{z) = Hs{y). 

2- nd move: z w, where w is a focusing point for z with respect to either d^^^'^^ 

or, if e-instantaneous constraints are absent, Cf . 

Refracted falling step. 

1- st move: y ^ z, where z is a point of the constrained characteristic Qy{Hg, C^) 

such that Hg{z) = H^{y). 

2- nd move: z w, where w is a focusing point for z with respect to (7™''*'*'' or, 

if e-instantaneous constraints are absent, Cf . 

We shall refer to (u,£) (resp., {u,e)) as a reflected (resp. refracted) falling 
point. 

Definition 16 (Decisive points for discontinuous Hamiltonians). Con- 
sider a regular discontinuous nonholonomic system {M, H,C\N). Let (7™^' he 
a set of instantaneous constraints along N. Let y he an e-out point. An {y,e)- 
decisive point is a falling point for y. The vector field x™**'*'' (resp., Xf if 
(jvnst — 0j jg called the vector field corresponding to a reflected falling point. 
The vector field Xf^''^'^^ (resp., Xf if Cf^''*' = %) is called the vector field corre- 
sponding to a refracted falling point. 



finst,tr 



Cf ^ c, 
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4-.6. Transition Principle. From a physical point of view, the Transition Prin- 
ciple formulated below is an explicit description of the discontinuity of a tra- 
jectory of a regular nonholonomic Hamiltonian system 5* that occurs when it 
traverses a critical state. Such a discontinuity is interpreted as an impact, col- 
lision, reflection, refraction, etc, depending on the physical situation modeled 
by the system 5*. From a mathematical point of view, the Transition Princi- 
ple corresponds to the definition of the trajectory of a regular discontinuous 
nonholonomic Hamiltonian system. 

The elastic or inelastic character of the impulsive motions of an specific phys- 
ical system must be taken into account when defining the trajectories. Accord- 
ingly, there are two different versions of the Transition Principle that distinguish 
between the two situations. Let S = (M, H, C\N) stand for a regular discontin- 
uous nonholonomic system and let (7'"''* be eventual instantaneous constraints 
imposed on S along N. Let (M, <;) be the associated cutting-up of AI along N 
(cf. Section rO|l . The regular part of a trajectory of the system S = (M, H, C) is 
the part of the trajectory of the Hamiltonian vector field Xj^ ^ that lies outside 

dM. The regular part of a trajectory of S is the image by ^ of the regular part 
of the corresponding trajectory of S. At least locally, the regular part may be 
viewed as a piece of the trajectory of the vector field Xn^fi^ that lies outside 
the hypersurface T^AI. 

Transition Principle. Let S — (M, H,C\N) be a regular discontinuous non- 
holonomic system and let C*™*'' be eventual instantaneous constraints on S along 
N . If a regular trajectory of the vector field Xh^ fi^, e = ± reaches the critical hy- 
persurface TjfM at a point y, it then continues its motion from any (y, e)-decisive 
point according to the chosen mode, elastic or inelastic, under the control of the 
corresponding constrained Hamiltonian vector field. 

Some features of the Transition Principle are worth mentioning. First of all, 
it prescribes a splitting of the trajectory when the number of decisive points is 
greater than one. Of course, it is difficult to imagine that a true mechanical 
system "goes into pieces" when reaching the critical hypersurface. But it may 
perfectly happen when a Hamiltonian system describes the propagation of sin- 
gularities in a fields or a continuum media. A classical example one finds in 
geometrical optics when a light ray passing from one optic medium to another 
splits into reflected and refracted rays (see, for instance, (31)1. The trajectory 
may also be trapped by the critical hypersurface. This happens when an "impact" 
state y possesses no y-decisive points. 

5. Mechanical systems 

In this section, we particularize the previous discussion to mechanical systems 
subject to affine constraints. Let t; be a Riemannian metric on M and V € 
C°°{M), and consider the mechanical system whose kinetic energy and poten- 
tial function are T{q,v) — ^g{v,v) and V, respectively. The corresponding La- 
grangian function is L{q,v) — T{q,v) — V{q) and the Hamiltonian one is 

Hiq,p)^fiq,p) + V{q), (7) 

where T{q,p) = ^G{p,p), and G is the co-metric, i.e., the metric on the cotangent 
bundle induced by g. In a local chart on M, the local expressions of g and G 
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are 

g = gabdq'' ® dq\ Q = g'^^^ — ® . 

In the mechanical case, the Legendre transform Cl ■ TM — > T*M is a hnear 
bundle mapping whose local description is CL{q'^,q°') — {q'^,gabq'')- 

Consider an afhne distribution C — C'^ + Y in T*M determining some non- 
holonomic constraints on the system (M, H). The linearity of an = implies 
that the space a}i(G) — aaiC^) + OLuiY) is a distribution of afHne spaces on 
M, or otherwise said, that aniC^) is a linear distribution on M. Throughout 
this section, we will often resort to the shorthand notation V = ^^(C'^) and 
T — aniY). Now, it is easy to verify that T*M = C° Ann(X')q, with associ- 
ated projectors 

Vg : T;M , : T;M Ann(I?), , qeM. 

Let 111 — iiiadq"^ , . . . , fim — fJ-madq"^ be 1-forms such that (locally) Ann(P) = 
span{/ii, . . . , fim}- Define the local function fiiQ : M ^ M by fJ.iQ{q) = —iii{T[q)). 
Then auiC) is locally defined by the equations 

Mial?)?" + Aiio(9) = 0, l<i<m. 

Now, consider the matrices 

G^igab), J^(p^a), B = JG'^ j' . (8) 

From the discussion after Proposition^ recall that {Ah,Ci ^c) is a partial sym- 
plectic structure if and only if anlc is an immersion, or, equivalently, if the 
compatibility condition is verified. Following 11 , the latter is equivalent to the 
matrix B being invertible. A direct computation give the following local expres- 
sion for the projectors V and Q, 

r{x) X - Q{x) , Q{x) = .PB-^JG-^x, x€T*M. 

Finally, let C M be a hypersurface and assume that the nonholonomic sys- 
tem S = {M, H, C) is transversal to N . Consider also a set of instantaneous 
nonholonomic linear constraints C""®' = (C™^^ ,Tcinat) imposed on S along A^. 
Note that T*M = C^"''*° © Ann(P'""'),, with associated projectors 

T';""' : T*M C'^""^" , Q^""^ : T*M — > Ann(P'""')g , q&N . 



5.1. Focusing points. Since the mechanical system is subject to an afhne dis- 
tribution of constraints. Proposition |31 implies that for a given u € T*M, the 
focusing locus is F(^h.c){i^) — {^('") + 2(^)}- Regarding the instantaneous focus- 
ing points, according to Proposition^ one has that -F'(H,c'"=SAf)(w) = {'P™^^{u) + 
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5.2. Constrained characteristics. Here we give an explicit description of the char- 
acteristic curves. Let N be the critical hypersurface, and assume that (locally) 
N = f-\Q), with / e C°°(M) verifying that dgf ^ for aU q e N. Consider 
the covector field V{df) along N defined as g i-^ 'P{df)q = Pq{dqf), qeN. The 
transversality assumption between C and N implies that V{df)q ^ 0, for all 
q^N. Clearly G C". In addition, for u e P n TN, 

V{df){v) = (df - Q{df)){v) = df{v) =0, 

and one can conclude that C° fl Ann(2? n TN) = spanlV (df)} . Therefore, we 
have the following result. 

Lemma 3. The constrained characteristic of a mechanical system (Af, H, C\N) 
passing through y G CHT^M is given by Cy — J/ + span{7'(c?^(j,)/)} C COT^M . 

Note that in the absence of constraints one recovers the standard non-constrained 
characteristic (^y — y + span{o?^(j,)/} passing through y. 

5.3. Decisive points: elastic mode. 

5.3.1. Change of constraints. Let C± <Z T* M he two affine constraint subman- 
ifolds. Denote by V± and Q± the projectors corresponding to C± and the co- 
metric Q. Let ?/ G Ce n T^M be a e-out point, e G {+, — }. Then, according 
to Definition 1131 an y-admissible sequence, (j/i, £i), i — . . . ,k, is necessarily 
of the form yj+i = Vet+iiyi) + Qei+A'^c^^-^)- If instantaneous constraints are 
present, then one has to use the projectors p™^^ and Q™^^ instead of and Qe, 
respectively. 

Remark 8. Mechanical systems subject to generalized constraints are also treated 
in |14j in a somehow different context. The approach taken there makes use of 
generalized (i.e. non-constant rank) codistributions defining the nonholonomic 
constraints and a generalized version of Newton's second law. Under appropriate 
regularity conditions, it can be seen that the 'post-impact' point in |14| is a 
decisive point of the Hamiltonian system according to Definition II HI 

5.3.2. Discontinuous Hamiltonian systems. Let C± C T*M be two affine con- 
straint submanifolds. Let g± be a Riemannian metric on M± and V± G C°°{M±) 
such that 

H±iq,p)=f±iq,p) + V±iq), f±iq,p) ^ ^g±{p,p) . (9) 

For simphcity, we only treat the case V± — V\m±, V G C°°(M). We denote 
by 'P± and Q± the projectors corresponding to C± and the co- metric Q±. Ad- 
ditionally, let C^^^^ C T^M be affine constraint submanifolds corresponding to 
some instantaneous constraints imposed along N. Denote by 'P^_^^^ and Q'j?''* the 
projectors corresponding to C™^* and the co- metric Q±. 

Let y G CeCiT^M be an e-out point. Following Definition II 41 we first describe 
the reflective and refractive steps with initial point {y, e). According to LemmaOl 
we have to look for points of the form 

x^y + cVeidqf), q = TT{y), 

for some c, which in addition belong to the same iJ-energy level as y. 
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Reflective step. Concerning the 1-st move, note that y + cVeidqf) and y belong 
to T*M. Then, the equahty H,{y + cVe{dqf)) = H,{y) imphes that f^{y + 
cPM))=fM- Now, 



T,[y + cPM)) = 



tiy) + cg,{y,VM)) + -g,{P,{dJ),Ve{dJ)) , 



and, therefore, we have 

C {ge{y,Ve{dJ)) + ^ge{Ve{dJ),Ve{dJ))) =0, 

with sohitions 

An important property of these points is contained in the foUowing lemma. 
Lemma 4. Let y d H T^AI and c^^i he the constant given by (|10|l . Then, 

g,{y, dj) = g.ivM + Qe(T,), dj) , 

gdy + Ce,2Pe{dgf), dj) - g,{^Ve{y) + Qe(r,), dj) . 

Proof. The first statement follows by noting that if y G Cq, then y = Ve{y) + 
Qe{Tq). For the second one, notice that 

g,{y + C,,2V,{dJ), dgf) = ge{y, dgf) + C,,2ge{Ve{dgf), Veidgf)) 

= ge{y,dgf ~ 2V,{dgf)) = -g,{vM,'Pe{dgf)) + ge{Qe{y),dgf) 

= ge{-vM + Qe{rg),dgf), 

which gives the desired result. □ 

The 2-nd move simply consists of determining the focusing points for points 
with respect to C"^^^ or, if e-instantaneous constraints are absent, with respect 
to Ce. This is done in terms of the corresponding projectors, exactly as explained 
in Section above. 



Refractive step. Concerning the 1-st move, the equality Hg{y + cVsidqf)) — 
H-M implies f,{y + cV,{dqf)) = f,{y). Now, 

f,{y + cVeidqf)) = TM + cg,{y,re{dqf)) + ^ge{Ve{dqf).V,{dqf)) . 
Therefore, one has 

c{g,iy,P,idqf)) + ^g,iPM),P,idqf))) +TV(y)-f,(2/) =0, 
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with solutions i ~ 1,2, 

^Jg,{v,vM)Y - 2g,{vM),vM)){f,{y)-Te{y))) ■ (n) 

As before, the 2-nd move simply consists of computing the focusing points 
for the solutions 1)11(1 with regards to C"'*'' or, if e- instantaneous constraints are 
absent, C^. This is done in terms of the corresponding projectors according to 
Section [Ql 

Discontinuous Hamiltonian systems with smooth constraints. In this situation, 
there is a single constraint submanifold C, and a discontinuous Hamiltonian H± 
on T*M . Denote by 'P± and Q± the projectors corresponding to C and the co- 
metrics Q± , respectively. According to Proposition |S1 the decisive points for a 
given e-out point y € C OT^M are simply the in and trapping points belonging 
to the intersection of the constrained characteristic (y passing through y with 
the set {z G C \ H±{z) — H^{y)}. Therefore, as candidate e-decisive points 
we have the solution corresponding to Ce^2 in PUII - and as candidate e-decisive 
points we have the solutions corresponding to Cg,i, i — 1,2, in (|11() . 

Proposition 6. Lety e CeCiT^M be a e-out point. If the constraints are linear, 
C — C*', then the solution corresponding to 0^.2 iri ((l()|l is a e-decisive point for 

y- 

Proof. The basic observation is the second order character of the dynamics, both 
in the presence and in the absence of nonholonomic constraints. This implies 
that for any y G T* M and any distribution of affine constraints C, we have 
XH{f){y) = XH,c{f ){y): since / is only a function of the configurations. Note 
that if H is of mechanical type, then XH{f){x) = Q{x, dqf), for any x G T*M. 
Now, from Lemma 0] taking H — H^, one gets 

ge{y + c2Pe{dj),dj) = -g,{y,d,f) . 

Since y is a e-out point, then G^iy, dqf) ^ 0. Consequently, Xh^ {f){y+C2'P+{dqf)) = 
—Q^{dqf, y) has the opposite sign, and hence it is an in point. □ 

5.4. Decisive points: inelastic mode. 

5.4.1. Change of constraints. Let C± C T* M be two affine constraint subman- 
ifolds and let C™^^ be a set of instantaneous affine constraints. We denote by 
'P^^^'^ and Q'j}^' the projectors corresponding to C^™*^ and the co-metric Q. If 
the instantaneous constraints are absent, denote by 'P± and Q± the projec- 
tors corresponding to Cj5 and the co-metric Q. Let y E C\ T^AI be a e- 
out point, e G {+,—}. Then, according to Definition 1151 the unique y-decisive 
point is V™^^{y) + Q™^^{Tc-) (or, if there are no instantaneous constraints. 
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5.4-2. Discontinuous Hamiltonian systems. As in Section 15.3.21 let C± C T*M 
be two affine constraint submanifolds, g± a Riemannian metric on M± and 
V± S C°°{M±) such that equation @ is verified. For simphcity, we only treat 
the case V± = V\m±, V e C°°(M). We denote by V± and Q± the projectors 
corresponding to C± and the co- metric Q±. Additionally, let C]^^^ C T^M be 
affine constraint submanifolds corresponding to some instantaneous constraints 
imposed along N. We denote by p^'i'^t.tr ^^^^ Qmst,tr ^-j^^ projectors associated 
with the submanifold C^'^^^'^'^ and the co-metric Qg. In the absence of instanta- 
neous constraints, we denote by 7^'"' and Qf the projectors associated with the 
submanifold Cf^ and the co- metric Ge- In case N = dM, we denote the latter 
with the superscript "9" instead of "tr" . 

Let y G n T^AI be an e-out point. The points associated with y resulting 
from the 1-st moves in a reflected or a refracted falling step are given, respec- 
tively, by equations (|10|l and As before, the 2-nd move simply consists 
of computing the focusing points for these solutions with respect to C^"*''^*'' for 
a reflected falling step (respectively, (7™*'*'" for a refracted falling step) or, if 
the instantaneous constraints are absent, C^'' (respectively, C|''). This is done 
in terms of the corresponding projectors according to Section l5. II According to 
Definition 1161 this gives all the i/-decisive points. 

Proposition 7. Let y € C'^ D T^M be an e-out point and assume that the con- 
straints are linear. For N = dM , the unique y-reflected falling point is given 
jjy pmst,d^y^ ^gj,^ aiiggjiQg of e -instantaneous nonholonomic constraints, 

Proof. From the previous discussion, we know that the points in the constrained 
characteristic passing through y with the same i/g-energy level are y itself and 
y + c^.,2'Pe{dqf), q = n{y) (cf. equation (fTUI) ). Now, note that dqf belongs to the 
-orthogonal complement of {T{dM)), i.e. 

GeidJ^P) = dJianAP)) - 0, /3 G aHliTidM)). 

Using the equality Cf = n a^^^(T(aAf)), we have that dj £ a]jl{T{dM))^- 
implies V'^''*-'^ {Ve{dqf)) = andVf (^^(d,/)) 0. The result is then a conse- 
quence of Definition [TBI □ 



5.5. Energy behavior. In this section, we discuss the consequences regarding the 
energy behavior of the system that result from the application of the Transition 
Principle. 

Lemma 5. Given y £ T^M , let x = V{y) + Q(Tq), q = 7r(y), be the associated 
y-focusing point with respect to a submanifold C C T* M . Then 

f{x)<f{y)+f{Q{Tq),Q{Tq)), 

and the equality holds if and only if y belongs to . 
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Proof. Note that 

Q{V{y) + Q{T,),V{y) + Q(T,)) = g{V{y),r{y)) + g{Q{T,), Q{T,))) 
< Q{r{y),V{y)) + g{Q{y), Q{y)) + g{Q{T,), Q{T,)) 

^g{y,y) + g{Q{T,),Q{T,)), 

where we have used that g is positive-definite, and the fact that and Ann(2?) 
are orthogonal spaces with respect to the co-metric g. If the equahty holds, then 
g{Q{y)^ Q{y)) = 0, which is equivalent to y G C°. □ 

As a consequence of this simple lemma we can conclude that in the case of 
linear constraints the Transition Principle always implies a loss of energy. This is 
a suitable generalization to constrained systems of the classical Carnot theorem 
for systems subject to impulsive forces |33| . 

Theorem 1 (Carnot's theorem for generaUzed linear constraints). Sup- 
pose that the Hamiltonian system is subject to nonholonomic constraints given 
by a linear distribution. Then the Transition Principle implies always a loss of 
energy as the result of an "impact". 

Proof. Under linear constraints, note that T = 0. From LemmaEl we get T{x) < 
T{y) with F(^fj,c){y) — {^}- The result now follows from the formulation of the 
Transition Principle and the definitions of decisive points in Section 14.51 (cf. 
Definitions ll3ll6|l . □ 

Under linear constraints, the trajectory of the system maintains the same 
energy level after the application of the Transition Principle in the following 
cases: 

(i) when the decisive points are determined according to Definitions 1131 and 1151 
and the impact point y e T'^M belongs to C+ fl C_; and 

(ii) when the constraints are smooth, and therefore the decisive points are deter- 
mined according to Proposition 

If the decisive points are determined according to Definitions El and El then 
nothing can be said in general. The refractive steps will typically imply an energy 
decrease. 

Remark 9. This type of energy arguments also allows to discard as follows the 
possibility of chattering when computing the y-decisive points if the constraints 
change (see Definition El and RemarkEl above). Let TV = {y e T*M \ f{y) = 0}. 
Assume there is an infinite y-admissible sequence {yi, e^), z = 0, . . . , oo. For each 
z, we have that ^ 2/i+i, since otherwise 

^kc.,+, {f){v^+l) = X'H{f){y^+l) = ^Vc.^ ifM) , all I , 

which together with the fact that yi is a Ei-out point, implies that yi+i is a 
ffi+i-in point. The latter contradicts the definition of admissible sequence. As a 
consequence of Lemma O we then have 

f{y„) > fiyi) > f{y2) >■■■> f{y,) > . 

The limit of this sequence is zero, which implies that the y-decisive point corre- 
sponding to such a sequence would be 0, that is, the trajectory would get 'stuck' 
when reaching N. 
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5. 6. Integrable constraints. The integrability of the constraints simpHfies the ap- 
phcation of the Transition Principle. Consider, for instance, the situation when 
the mechanical system is unconstrained on M„ and is subject to some gener- 
alized linear constraints C = C'^ on AI^ that turn out to be holonomic, i.e., 
C(h{C^) = T) is integrable. Denote by {Sa}, a being an m-dimensional parame- 
ter, the foliation of induced by I?. Locally this foliation is described by m 
functions fi G Coo [M] such that 



A similar situation has been treated in in the context of totally inelastic 
collisions (note, however, that in the integrable distribution is defined only 
on N, whereas here V is defined on M_|_). The integrable constraints imposed by 
V can be interpreted as an abrupt reduction of the phase space of the mechanical 
system. 

By definition, one has that Ann(I?) = spanjd/i, . . . , df,n}- The matrix J in ||SJ) 
is then given by J = {dfi/dq°') and the projector V is V{x) — {l — J*B~^JG~^)x. 
Let y G C- n T^M be the impact state of a trajectory q{t) coming from M_. 
From the discussion in Section 15.31 we obtain that the unique focusing point 
associated to y is x = (1 — J^B^^JG~^)y. The trajectory will continue its motion 
in M+, M_ or N depending on the in/out /trapping character of the focusing 
point X. If it evolves in M+ (more precisely, in Sa C M+ with a such that 
X ^ Sa), we call it the 'refraction' of the original trajectory. If it evolves in M_, 
we call it the 'reflection' of the original trajectory. 

6. Examples 

In this section we consider four examples to illustrate the theory exposed above. 
They all present the example of a rolling sphere considered in various constrained 
situations. The first one is taken from and is treated here in order to provide 
a further comparison with previous approaches. The second one combines the 
presence of smooth nonholonomic constraints with discontinuous Hamiltonians 
and instantaneous constraints acting on the system along a hypersurface. The 
third one consists of a ball rolling on a rotating surface whose angular velocity 
is suddenly changed to a different value, and this is modeled via a discontinuous 
affine distribution of constraints. Finally, the fourth one presents a two- wheeled 
system with a rod of variable length and illustrates the application of the Tran- 
sition Principle in both the elastic and the inelastic modes. 

6.1. A rolling sphere. Consider a homogeneous sphere rolling on a plane. Assume 
it has unit mass (m = 1) and let be its inertia about any axis. Let {x, y) denote 
the position of the center of the sphere and let ((/?, 0, ijj) denote the Eulerian 
angles. The configuration space is therefore Q = M.'^ x SO (3). Assume that the 
plane is smooth if a; < and absolutely rough if x > (see Figure ^| . On the 
smooth half-plane, the motion of the sphere is assumed free, that is, the sphere 
can slip. On the rough half-plane, the sphere should roll without slipping due to 
the constraints imposed by the roughness. We are interested in determining the 
eventual sudden changes in the trajectories of the sphere when it reaches the 
line separating the smooth and the rough half-planes. 
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Fig. 1. The rolling sphere on a 'special' surface. 

The kinetic energy of the sphere is 

T^^{x^+f + k\Lol+u;l + Lul)), (12) 

where uJx, and ojz are the angular velocities with respect to the inertial frame, 
given by 

ojx = cos ip + ip sin 9 sin ip , ujy = sin ip — (p sin 9 cos -0 , <jJz = ^ cos 9 + ip . 

The condition of rolling without sliding of the sphere when a: > implies that 
the point of contact of the sphere and the plane has zero velocity 

(j)^ = X — rujy = , (fP' = y + rujx = , 

where r is the radius of the sphere. 

Following the classical procedure |2HIi we introduce quasi-coordinates ^q^\ 'g^' 
and 'g'^' such that g^'= uJx, '9^'= ojy and 'g^'= w^. The latter merely have a sym- 
bolic meaning in the sense that in the present example, for instance, the partial 
derivative operators d jdtf should be interpreted as linear combinations of the 
partial derivatives with respect to Euler's angles. Also to the differential forms 
dq^ one should attach the appropriate meaning, i.e. they do not represent exact 
differentials but, instead, we should read them as dq^ = cos'i/'d^ + sin^sin^/'diyS, 
etc. 

The singular hypersurface N is defined by7V=={a; = 0}. In this case, the 
constraints are linear and the nonholonomic distribution (C) = 2? on Mj^ is 
given by 

{ d d d d d \ 
= span |r- + _,-._ + _, _| . 

Here we are dealing with a single distribution which constrains the motion 
on M+. 

In the following we compute the decisive points for this example. Let A G 
C_ V\T*M be a minus-out point. A direct computation shows that the expression 
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of the projector V : T*M —tCin local coordinates is 



V 










-rk^ 








r 









r'-'+k'^ 









1/ 



(13) 



Therefore, the single focusing point for A e T^M is given by a; = 'P(A) e 
Cnr^M. If we denote A = (a;o, ?/o, 9o> (Pa:)o, (P!/)o, {pi)o, (^2)0, (^3)0) and 
a; = ix,y,q^,q'^,q^,pj:,Py,pi,p2,P3), we get 



Py 



r'^{ Px)Q +r{p2)o 



+ fc^ 



Pi 

P2 
P3 



-rP{Py)o + fc^(pi)o 
+ P 

rk'^{Px)o + fc^(P2)o 



(P3)o 



fc2 



Note also that the focusing point with respect to C_ — T*M associated with a; 
is X itself. Therefore, if a; is a plus-out point, the only admissible sequence for A 
is {(A, — ), (x, +), (x, — )}. If X is either a plus-in or a plus-trapping point, then 
the only admissible sequence for A is {(A, — ), (a;, +)}. The set of plus-trapping 
points for the dynamics Xh,c+ is d{T*MY = {m e T*M | a; = , p^, = 0}. 
Consequently, the trajectory is refracted, i.e., the sphere follows its motion on 
M+ under the dynamics Xh,c+ (rolling without slipping) if p^ > 0. Otherwise 
(i.e., if Px < Q), the trajectory is reflected by the "roughness" and continues in 
M_ under the dynamics Xh starting from x. 



6.2. A rolling sphere hitting a wall. This is a classical example I 12II17II28| that we 
treat here for the sake of completeness. Consider again a homogeneous sphere 
of radius r and unit mass. Assume that the sphere rolls without sliding on a 
horizontal table, and that at a certain instant of time it hits a wall determined by 
the plane x — d> Q (cf. Figure 12). When this happens, the following constraint 
is instantaneously imposed on the system, 

ip ^ y — rujz ~ . 

Therefore, we are in the situation explained in Remark |7| The configuration 
space of the system is M = M+ = {a; < d}, with the boundary iV = {a; = c?}, 
and the linear constraint submanifold C = C+ C T*M is given by a^f (C) = V, 

V- s,nlr— — -r— ——\ 
^ \ dx dq^ ' dy dq^ ' dq^ J 

The expression for the projector V : T*M ^ C is given by equation H13|) . The 
submanifold giving the instantaneous constraints along N is 

^inst ^ £ I i;{aH{X)) = 0}. 
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X — d 



Fig. 2. A rolling sphere that eventually hits a wall. 



The projector V" 



T*M C 



inst 



IS 



-r\y + Ai - A3 
r2 + 2fc2 



rdy + k^dq^ - k^d<f'^ 



(14) 



Let A = (xo, yo, go, go, (Pi,)o, (Py)o, (pi)o, (^2)0, (^3)0) £ C+nT^M c r*Af 
be a plus-out point, i.e., Cy(A, dx) < 0. We first consider an elastic impact. Since 
H- = 00, we only compute the outcome of a reflective step. According to (|10l) . 
the points in the constrained characteristic passing through A within the same 
energy level are 



A and \ + c+^2'P{dx), with c+,2 = -2 — — — {px)o ■ 

The associated focusing points are given by 

and ^'"''^(A) +c+,27''""'(^(rfa;)). 



(15) 



Note that V''^''^ {V {dx)) = 7''"^'(dx) = Vidx), and therefore the points in lO be- 
long to the same constrained characteristic and to the same i?4.-energy level. De- 
noting the coordinates of the point 7''"^'(A) by {x,y,q^ ,q'^,Px,Py,Pi,P2,P3), 
we get 



Px = {Px)o 
Py = 



2fc2 



Pi = -k 
P2 = k 



2(r2 + P)(p^)o + r(p3)o 



2 iPx)o 



r(r2 + 2fc2) 



P3 



.^{r^ + k^)iPy)o + r{p3h 



r{r^ + 2/fc2) 



Now, notice that P'"''*(A) is a plus-out point, because 5('P'"'''(A), dx) = 
Q{X,dx) < 0. Therefore, following Proposition |B1 we conclude that the sequence 
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{(A, +), r'"^''^{X)+c+.2V'"^^^{dx),+)} is A-admissible, and V'"^^\X)+c+^2V'"^^\dx) 
is a decisive point. The other possible A-admissible sequence corresponds to 

{(A, +), P'-*(A), P"-'(A) + c+,2V'^''{dx), +)} , 

but renders the same decisive point. 

In the case of an inelastic impact, Proposition [7| yields that the unique A- 
decisive point is 7""^*'^(A). After the impact, the ball continues its motion along 
the wall under the dynamics specified by the vector field x™''^' . 

6.3. A rolling sphere on a rotating table. Consider again a homogeneous sphere 
of radius r and unit mass. Assume that the sphere rolls without sliding on a 
horizontal tabic which is rotating with certain constant angular velocity about 
a vertical axis through one of its points (see Figure |2J|. Let and fi+ be two 
angular velocities. Here we consider the following situation: each time the sphere 
reaches the hypersurface x = j/, an impulsive force is exerted on the table to put 
it spinning with a different angular velocity. That is, if the angular velocity of 
the table was the force applied on its rotation axis changes it to and 
vice versa. We assume that < f2+. This can be modeled as thinking of a 
system which is subject to two different affine constraint distributions. In order 
for this model to be consistent, we also assume that the surface of the table is 
rough enough so that sphere is rolling without slipping at all times. 




Fig. 3. A rolling sphere on a rotating table. 



The Lagrangian is again given by equation H12|) . The nonholonomic con- 
straints are now affine in the velocities, 

i — rujy — —fiy, y -)- rux = fix . 

The constraint space auiC) is then described by 

{ d d d d d ^ 
aniC) ^V + Y = span + -r— + —,— \+Y, 

[ ox oq'^ ay oq^ oq'^ J 

where Y is the vector field defined by 
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Note that the projection of T = ^LiY) to Ann(X') is given by 

= 9 , , 9 (-ydx + xdy - xrdq^ - yrdq^) . 
r'^ + ^ ' 

FoUowing the discussion for the case of afhne constraints, given y G T*M , the 
focusing point with respect to C± is given by 

X = V{y) + Q(r±) , 

where V is the projector in 

Assume that the sphere is roUing on the hyperplane M_ = {x < y} and that 
the constant angular velocity of the table is ]?_. Consider the case when the 
sphere "hits" the hypersurface N = {x = y} with the impact state 

A = {xa,ya,ql,ql,ql, {px)o, {py)o, (pi)o, (p2)o, (j33)o) e C_ = C° +r_ . 

Denote the coordinates of the associated focusing point by 

X = P{X) + Q(T+) = {x,y,q^,q'^,q^,px,py,pi,p2,P3) . 

Then 

Q{df, x) ^p^-py ^ {p^)o - {py)o + ^2^fe2 ixo + yo)i^- - n+) ■ (16) 

Given that A is an minus-out point, we have that G{df, A) — {px)o ~ iPy)o > 0- 
If xq = yo < 0, then the second term in (|16|l is also positive, and {(A, — ), (x, +)} 
is the unique admissible sequence for A. In this case, x is the A-decisive point. 
On the contrary, for certain values of xq — yo > 0, it might happen that Gidf, x) 
is negative, i.e., that a; is a plus-out point. Now, note that the focusing point 
associated with x is A itself, since 

Vix) + Q(r_) = ViViX)) + Q(T_) = P(A) + Q(r_) = A. 

As a consequence, in this case there would not be any A-decisive point. This 
problem stems from the fact the modeling of this example as a system subject 
to afhne constraints does not take into account that the jump in the angular 
velocity of the table takes place no matter what. Therefore, after the impact, 
we should really regard C+ as the new set of affine constraints acting on the 
whole configuration manifold. With this interpretation, x would obviously be a 
plus-in point (and hence decisive). In other words, the trajectory of the ball gets 
reflected back by the blow of the greater velocity J7+ . 

6.4- A two-wheeled system with a rod of variable length. Consider a system com- 
posed of two wheels of different radii, ri < r2, connected by a massless rod of 
variable length £ (see Figure 0J . For simplicity, assume that the two- wheeled 
system moves along a line, and that both the masses and the momenta of inertia 
of the wheels are unitary. The wheels are subject to the standard constraints 
of non-slipping. Assume that the length £ of the rod is constrained between a 
minimum length a and a maximum length b. Here we consider the following two 
situations: (i) when the length £ of the rod becomes extreme, an elastic impact 
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Fig. 4. A two-wheeled system with a rod of variable length. 



occurs; (ii) when the length £ of the rod becomes extreme, an arresting device 
fixes it, and therefore an inelastic impact occurs. 

The Lagrangian of the system is given by the kinetic energy of the wheels 



1 

^^2 



X2 



The conditions of rolling without sliding are encoded in the constraints 

ii - ri9i = , ±2 - 7-2 6*2 = , 

which, since we are considering the motion of the two-wheeled system only along 
a line, turn out to be holonomic. The constraint on the length of the rod is given 

by 



a < i ^ V (r2 - ri)2 + {x2 - a;i)^ < b . 

Following Remark □ we set M_ =0, M+ = M = {(xi, X2, 6*1, 6*2) e x x 
§1 I a < i{xi,X2, 6*1, 6*2) < b}, with boundary set N = dM = a;2, 6*1, 6*2) e 
X §^ X I t{xi^ a;2, 6*1, 6*2) = a or (.{xi,X2, ^i, 6*2) = and linear constraint 
submanifold C = C+ C T*M given by aff(C) = V, 



V 



span < ri 



d_ 

dxi 



d 



d 



dOi ' ^ dx2 



d 



The expression for the projector V : T*M ^ C in local coordinates is given by 
the following matrix 



( 


rl 





n 





\ 




l+r{ 







^1 













l+ri 






r\ 





1 











l+ri 




{ 












) 


l+ri 


l+ri 





Let A G T*M+ be a plus-out point with ^(A) = b and giX,di) > 0. Since = 
c», we only compute the outcome of a reflective step. Following equation (|10() . 
the points in the constrained characteristic passing through A with the same 
i/-i--energy level are A and A -I- c+.2'P{M), with 



C+,2 



' g{V{d£),V{d£)) 



(17) 
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According to Proposition IHl the point A + c-^-,2'P{d£) is +-decisive. 

Consider now an inelastic impact, i.e., the case when the length £ of the 
rod becomes fixed after the impact. Since there are no additional instantaneous 
constraints imposed on the system at the impact state, we compute the decisive 
points with regards to the boundary of the constraint manifold, 

= C CiaJj^^iTdM) = {{xi,X2,Oi,d2,Pxi,Px2,Pei,P02) e T*M \ p^^ = Px2, 
Pxi = ripe^, px2 = r2Pe2, i{xi,X2, 01,62) = a or £{xi,X2, 01,62) = b} . 

As before, we only compute the outcome of a reflective step. Following Propo- 
sition [7| we deduce that the unique decisive point is V^{\). After the inelastic 
impact, the length of the rod is fixed forever after, the velocities of the two wheels 
of the system are reset according to V^{\) and evolve according to X^^ ^ 
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